Learning to Generate Semantic Annotation for Domain Specific Sentences

نویسندگان

  • Jianming Li
  • Lei Zhang
  • Yong Yu
چکیده

Seas of web pages in the Internet contain free texts in natural language that are only read by human beings. To be understandable for machines, these pages should be annotated with semantic markups. Manually annotating large amounts of pages is an arduous work. This has made automatic semantic annotation an urgent challenge. In this paper, we propose a machine-learning based automatic annotation approach. This approach can be trained for different domains and requires nearly no manual rules. The annotation is on the sentence level and is in RDF format. We adopt a dependency grammar – Link Grammar [2] – for this purpose. ALPHA system, a prototype of this approach has been developed with IBM China Research Lab. We expect many improvements are possible for this approach and our work may be selectively adopted or enhanced.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Domain Specific Automatic Question Generation from Text

The goal of my doctoral thesis is to automatically generate interrogative sentences from descriptive sentences of Turkish biology text. We employ syntactic and semantic approaches to parse descriptive sentences. Syntactic and semantic approaches utilize syntactic (constituent or dependency) parsing and semantic role labeling systems respectively. After parsing step, question statements whose an...

متن کامل

Ontology Learning and Semantic Annotation: a Necessary Symbiosis

Semantic annotation of text requires the dynamic merging of linguistically structured information and a “world model”, usually represented as a domain-specific ontology. On the other hand, the process of engineering a domain ontology through semi-automatic ontology learning system requires the availability of a considerable amount of semantically annotated documents. Facing this bootstrapping p...

متن کامل

Learning to Generate CGs from Domain Specific Sentences

Automatically generating Conceptual Graphs (CGs) [1] from natural language sentences is a difficult task in using CG as a semantic (knowledge) representation language for natural language information source. However, up to now only few approaches have been proposed for this task and most of them either are highly dependent on one domain or use many manually constructed generation rules. In this...

متن کامل

Semantic Annotation of Resources of Distance Learning Based Intelligent Agents

This paper presents a system based on intelligent agents for the semantic annotation of learning resources taking into account the context of training. Semantic annotations systems rarely treat existing semantic annotations in the field of distance education (e-learning). Most researchers in the field of education limit annotations to specific cases (teacher annotation, learner annotation, anno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001